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Abstract — Our paper reanalyzes data trom the classic l%(> stud> Equality of Educational Opportunity. 
or Coleman Report. It addresses whether teacher characteristics, including verbal ability and race, 
mtkicmcd 'synthetic gam scores" ot students (mean test scores ot upper grade students in a school 
minus mean lest scores ot lower gtade students in a school), in the context of an econometric model that 
allows for the possibility that teacher ch.iiactcristics in a school are endogenously determined. 

We lind thai verbal aptitude scores ot teachers influenced synthetic grain scores for both black and 
white students. Veibal aptitude matte ted as much for black teachers its it did for white teachers, ttnally. 
holding leachct characteristics other than race constant, in some specifications black teachers were 
associated with higher gain scores tor black high school students, but lower gain scores for white 
elemcnlarv arid secondary students. Because these findings ;ire for American schools in the mid-1960s, 
they do not directly apply to our contemporary experience. However, they do raise issues that should be 
addressed in discussions of luring policies in American education. [J EL 131) 



I. INTRODUCTION 

Mom.MLU b\ the poor academic performance and 
high drop-out rates of many minority elementary 
and secondary school students vis-a-vis their white 
counterparts, as well as the fact that the tacial/cthnic 
distribution of public school faculty often does not 
reflect the racial/ethnic distribution of their 
students, many school districts have aggressively 
sought to increase their hiring of minority faculty. 
This policy has been purchased even in the face ol a 
declining pool of minorities seeking to enter careers 
in education and evidence that new minority 
teachers tend to fail the National Teacher 
Examination at a higher rate than new white 
teachers. 1 Confronted by fiscal stringency, many 
school districts base also begun to institute early 
retirement plans to encourage older, mote experi- 
enced, and often white, teachers to retire, thereby 
creating vacancies for lower paid new. or relatively 
inexperienced, minority teachers. 
These policies raise a host of issues. Minority 



teachers are thought by many to be more effective 
teachers of minority students because the former 
may serve as role models for. may interact better 
with, may have more favorable attitudes towards 
and higher expectations for. and may provide more 
positive feedback to, minority students. 2 Ulti- 
mately, however, society must be concerned about 
minority teachers' impacts on the educational (test 
scores, completed schooling levels) and post- 
educatioral (labor market success) outcomes of both 
minority and white students. Only if minority 
teachers improve, or at least leave unchanged, the 
outcomes of both groups (as compared to what 
while teachers would generate) can Minority recruit- 
ment policies in public education be judged pareto 
optimal in terms of their impacts on students. If they 
improve the outcomes for minority students but 
reduce the outcomes tor white students, the debate 
ov ci these policies will shift to their distributional 
(across student group) consequences. If minority 
teachers are show n to have no impact on the 
outcomes lot mtnotitv students and to adversely 
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influence those for white students, the debate will 
shift to one over the importance society places on 
providing employment opportunities for minority 
teachers to help remedy historical inequities and 
perceptions of current discrimination against poten- 
tial minority teachers. 

Of course, minority and white teachers differ, on 
average, on a number of dimensions other than 
race. They come from different socioeconomic 
backgrounds, have different levels of experience, 
have different degree levels, and tend to have 
received their degrees from different institutions. 
They also tend to score differently on standardized 
aptitude and achievement tests. While issues relat- 
ing to the "cultural bias" in test scores have been 
raised, studies do suggest that student s academic 
performance is related, on average, lo their 
teachers' performance on standardized tests. ' 
Comparisons of the effectiveness of minoritv and 
white teachers must control, if possible, for these 
other characteristics. 

Research on the relative effectiveness of minora \ 
teachers has been conducted primarilv b> soci- 
ologists, psychologists, and educational researchers. 
Most studies have focused on teachers" attitudes, 
teachers' expectations, teachers" placement ot 
students, and the feedback (positive and ncgali\c) 
that teachers provide students. J Onlv a few ha\c 
addressed educational outcomes ami none has 
addressed subsequent labor market success/ Main 
have been studies of a single school district and 
these typically failed to control for other teacher 
characteristics. Only a few studies used represen- 
tative national data bases, only a few attempted to 
model the process bv which teachers gel assigned to 
schools, and none controlled lor this process in the 
estimation of teacher effects. To our knowledge, 
none addressed whether the effects of teachers* 
verbal ability vary cither with the race of the 
teachers or the race of the students thev arc 
teaching. 

Our paper begins to address some of the issues we 
have raised, by reanalyzing data from the classic 
1966 study Equality of Educational Opportunity, or 
Coleman Report. As we describe in the next section, 
these data permit us to estimate how. during t l e 
mid-1960s, the characteristics of teachers of de- 
ferent races (verbal aptitude, degree levels, vears of 
experience) influenced an estimate of the change in 
test scores over a three grade le\el period, for 
students of different races. Thev also pet nut us to 



test whether controlling for the process by which 
teacher characteristics (including race) get assigned 
to different schools influences our estimated 
relationships. 

After discussion of the Coleman Report in the 
next section, sections III and IV present our 
empirical analvses. The two final sections then 
discuss the significance of our findings and their 
implications tor future research. 

II, THE COLEMAN REPORT 

The Coleman Report represented an important 
step in educational research. Its statistical analyses, 
based on data from o\cr 570.UKI pupils. 60.1HKI 
teachers, and 4.0(1(1 principals, represented the 
beginning of the "educational production function"' 
literature. 

The methodological approaches used in the 
( 'oieman Report were scvcrclv criticized and 
numerous reanalvscs of the data look place within a 
few vears of the Repot i\ publication." Most social 
scientists, and public attention, focused on its 
conclusions concerning the extent of race segre- 
gation in schools and the importance of lamilv 
background characteristics in explaining sanations 
in student achievement. I. ess \vell known (or well 
remembered) is that the iiiulerlving data set con- 
tained information on teacher verbal abilitv (as 
measured bv scores on a verbal aptitude test) and 
that the average verbal aptitude ot teachers in a 
school was seen to be positively correlated with 
student test scores. Both the original Coleman 
Report and subsequent reanalvscs of its data found 
this coi relation and sonic researchers concluded that 
the correlation appealed stronger at higher grade 
levels/ 

The Coleman Repot t data appeal to be unique 
among existing micro level data sets in that thev 
contain a measure ot individual teachers' verbal 
abilitv. One serious weakness of the data, however, 
is that thev represent a "snapshot" at a single point 
in time and that onlv a current vear test score 
measure exists for each student. Subsequent edu- 
cational research bv economists has stressed that to 
more fully control for unobservable student, tamilv. 
and communitv c haracteristics that influence 
student achievement, one should relate school 
characteristics, including teacher abilitv levels, to 
student gain scores, or changes in test scores over 
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time — not to student test seore levels at a point in 
time/ 

While all of the prior researeh that used the 
Coleman Report data estimated current year test 
score equations, the data do in fact contain infor- 
mation on third and sixth graders at each elementary 
school and ninth and twelfth graders at each high 
school. Moreover, one can identify the subset of 
third and sixth graders who spent their entire 
elementary school careers at a given elementary 
school and similarly identify the subset of twelfth 
graders who spent their entire high school careers at 
a gisen high school. Restricting one's attention to 
these subsets of student respondents, computing 
mean scores for each school by grade, and assuming 
that within a school, the unobservable character- 
istics do not vary across grades, one can compute 
•synthetic" gain scores as the difference between the 
mean :est scores in the two grades for each school. 
For example, the difference between the mean test 
scores of sixth graders in a school and the mean test 
scores of third graders in a school at the survey date 
can be taken as an estimate ot how much third 
graders in the school would learn if they remained in 
the school for three more years. 1 ' In eases where the 
schools have a significant number ol both white and 
black students, these gain scores can also he 
computed separately for each racial group. 1 " 

These gain scores are used as dependent v ariables 
in the next section in the estimation of educational 
production I unctions in w hich the gain scores by 
school ate related to student lamily, community, 
school, and teacher characteristics. Of primary 
interest to us will be the effect of the racial 
composition of teachers in a school and their verbal 
abilities on the gain scores of students of each racial 
group." (iisen prior mentioned concerns about the 
alleged "cultural bias" of tests, we also address 
whethei incteasing the serbal ability of teachers of 
each race has the same impact on the gain scores ot 
students o! different races. 

While conceptually such an analysis is straight- 
lorwaul. an important statistical issue exists. 
I cachets are not randomly assigned to schools and 
school districts; teachers with higher test scores may- 
be more easily attracted to higher paying districts, 
districts with smaller class si/es. and districts whose 
families ate highly educated. Similarly, teachers 
may prclcr to work with students who come from 
the same racial group or from similar socioeconomic 
hackmouiuls. To make sure that such nonrandom 



assignment does not lead to biased estimates of the 
effects of teacher characteristics on student gain 
scores, an instrumental variable approach is 
employed in section IV to control for the process by 
which teachers and schools are matched. 12 

III. ESTIMATING SYNTHETIC GAIN SCORE 
EQUATIONS 

A. Descriptive Statistics 

The Equality oj Educational Opportunity (EEO) 
data tapes that we received from the National 
Archives contained data for third grade students at 
2499 schools, sixth grade students at -389 schools, 
ninth grade students at 930 schools and twelfth 
grade students at 787 schools. lX We restricted our 
attention to the subset of elementary schools for 
which data were reported for both elementary 
grades and the subset of high schools for which data 
were reported for both secondary grades. We also 
required that data for each school were reported on 
all of the explanatory variables used in the analyses 
that follow, including the characteristics of 
teachers. 1 " 1 All data were aggregated to obtain 
school level mean values for the entire sample, for 
white students at the subset of schools that had some 
white students in attendance, and for black students 
at schools that had some black students in 
attendance. l * 

As Table 1 indicates, the restrictions left us with a 
maximum sample of 969 elementary schools and 256 
high schools. lh Of the former. 799 had at least one 
white student in both grades and 514 had at least one 
black student in both grades. Of the latter. 178 had 
at least one white student in both grades and 183 had 
at least one black student in each grade. Because the 
elementary school sample sizes are so much larger, 
the majority of the analyses that follow use the 
elementary school data. 

Students in each grade were administered verbal 
aptitude, nonverbal aptitude, reading and math- 
ematics tests. The weighted (by number of students 
taking the tests) mean percentage of correct answers 
on the four tests across schools was 58.18 for third 
graders and 52.28 for ninth graders. The weighted 
mean synthetic gain scores, the mean for the sixth 
grade minus the mean for the third grade and the 
mean for the %vclth grade minus the mean for the 
ninth grade. \ :re 1.55 and 0.90. respectively. 

At the elementary school level 31 % of the 
students and 27% of the teachers were brack, while 
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Table I. Descriptive statistics: weighted means (by schcx^l size) 
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mean percentage of correct answers of third grade students in the school on the group ol test** (ninth 
grade tor high schools) 

mean percentage of correct answers ot sixth grade students in the school on the group of sixth grade tests 
minus the mean percentage ot correct answers of third grade students in the school on the gioupol third 
grade tests (twelfth and ninth grades for high schools) 

percentage of black students in the third and sixth grades in the school (ninth and twelfth tor high 
school) 

percentage ot black teachers in the school 

mean sears of teaching experience ol teachers in the school 

mean percentage of teachers in the school with ;it least a master's degree 

mean percentage ot correct answers of teachers in the school on the verbal test 

mean values tor black teachers in school 1 with positive numbers ot black teachers m the sample 
mean vaiucs for white teachers in schools with positive numbers of white teachers 



proportion ot schools with at least one black teacher 
proportion ol schools with at least one white teacher 
number of schools 

•'mean tor white students is significantly different I mm the mean lot black students at the 0.05 level ol significance, two- 
tail test. 

Source: Authors' computations from the t\<puilnv of i.dui ational Opportunity survcv data tapes. 



at the high school level the comparable percentages 
were 29 and 25. 17 Elementary school teachers in the 
sample averaged close to I ft years of teaching 
experience, about 17"/,, of them had earned at least a 
masters degree and. on average, they answered 
correctly slightly more than 75°/,, ol the questions on 
a verbal aptitude test that was administered to them. 
High school teachers were quite similar on their 
experience and their verbal aptitude scores, but over 
34% of them had at least a n .islet v degtee. 



Table I also summarizes these data separate!) lor 
white students and lor black students, indicating in 
each case how their teachers* characteristics varied 
b\ race. White students tended to have higher base 
year test scores ami larger gain scores than black 
students at both the elemental) and sceomlai) 
levels.' N The typical white elemental y school 
student was enrolled in a school in which about (>"». 
ol the students and 4 n n of the teachers woe black, 
while the typical black elemental) school student 
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found 77% of his or her classmates and 71% of his 
or her teachers black. The respective values for high 
school students were quite similar. 

Both white and black teachers' verbal aptitude 
scores tended to be higher if they were employed at 
schools in which white students were enrolled than if 
the\ were employed in schools in which black 
students were enrolled. In addition, white teachers* 
verbal aptitude scores were higher than black 
teachers* verbal aptitude scores in both types of 
schools, although at the high school level the 
ditlcrencc at schools in which white students were 
enrolled was quite small. Experience and degree 
differences across the two types of schools and. 
within a tvpe. across the two types of teachers were 
less uniform and were not always in favor of white 
teachers. For example, in both elementary and 
secondary schools at which white students were 
present, black teachers were more likely than white 
teachers to have advanced degrees. 1 " 

B. Elementary School Analyses 

Estimates are presented in column (1) of Table 2 
of synthetic gain score equations of the form, 

GAIN, = a„ + ct,X, + a : S, + ouBYTEST, 

+ a/V, + (1) 

Hcie X, represents a vector of characteristics of 
school i\ students, their families, and the eoni- 
niunitv in which the school is located; S, represents a 
vector ot characteristics of the school: and T, 
represents a vector of characteristics of the school s 
teachers. BY TEST, is the average test score ot grade 
3 students in the school and (JAIN, is the difference 
between the average test score of grade 6 students in 
the school and the average test score of grade 3 
students in the school. 

Included in X, are the percentages of the school's 
students that are female (FKM). black (BLACKS), 
have no lather or no mother in the home (FNHH. 
MNHH). have a telephone in the household 
(PHONE), and receive free lunches (FLNCH). the 
mean income of the families of the school's students 
(INCOME.), the mean education levels of the 
lathers and mothers of the school's students (FED. 
MI D), and whether the school is located in a 
central citv (( I'l'Y). rural (RURAL), or suburban 
(the omitted category) area :n The school character- 
istics are the number of books per pupil in the 
school's library (BOOKS) and the pupil/teacher 
tatio m the school (PUP1). Finally, the teacher 



characteristics are the percentage that arc black 
(BLACKT). the mean years of teaching experience 
(EXP), the percentage with at least a masters degree 
(MA), and the mean verbal test score of teachers in 
the school (VERB). Since the schools in the sample 
vary considerably in size, the method of weighted 
least squares is used to obtain the estimates. 

Gain scores prove to be higher in schools with a 
greater percentage of female pupils, a smaller 
percentage of bkick students, fewer families with 
only one parent in the household, more families 
with telephones, fewer families receiving free 
lunches, and higher parental education levels. 
Relative to suburban schools, gain scores are higher 
in rural schools and lower in central city schools. 
Higher pupil/teacher ratios are associated with 
lower gain scores. Finally, "regression to the mean" 
is present, as higher base year test scores arc 
associated with lower gain scores. 

Of primary concern to us is the role that teacher 
characteristics play. In this model, increasing the 
percentage of black teachers by 10 percentage points 
is associated with a 0.4 decrease in the school's gain 
score. Teachers* experience is positively associated 
with the gain score, but teachers' degree level does 
not appear to matter. Crucially, higher verbal 
aptitude scores for teachers are associated with 
higher gain scores for students. If teachers' verbal 
aptitude scores could be increased by 10 percentage 
points, gain scores are predicted to be 0.9 points 
higher. 21 The latter should be contrasted to a mean 
gain in the sample ol 1.55. 

The remaining columns in Table 2 ascertain the 
sensitivity of these results to changes in the model's 
specification. While inclusion of the base year (third 
grade) test score on the right-hand side of equation 
(1) is justified because how much students learn 
over time depends on where they are starting trom. 
there are well-known statistical problems that result. 
On the one hand, if the base year test score and 
the gain score arc both influenced by a common set 
of variables and any of these variables are omitted 
from equation (1). then inclusion of BYTEST may 
lead to biased estimates of the coefficients of other 
variables in the model." On the other hand, if 
BYTEST measures students* true abilities with 
error, its coefficient will be biased towards zero and 
thus our estimate of the extent of regression to the 
mean overstated. 

One way to handle this problem is to omit 
BY I I.ST from the model and see whether this 
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Table 2. School level synthetic gain score equations: Grades 3 to 6 J> (absolute value / statistics) 
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-0.894 


d-9) 


1.062 (1.9) 


-0.876 


(l.S) 


■0.677 


(1.2) 


0.395 


(0 i\ 


RURAL. 


l.OSO 


(2.3) 


1.017 (l.S) 


1.052 


(2.2) 


0.915 


(l.S) 


0 S29 


(1.6) 


BOOKS 


0.010 


(0.3) 


0.015 (0.3) 


0.012 


(0.3) 


0.013 


(0.3) 


0.012 


(0.3) 


PU FI- 


-0.081 


C3) 


0.002 (0.0) 


-0.07ft 


(2.2) 


- 0.076 


(2.0) 


0 061 


(1.6) 


ST AY 








0.(K)9 


(1.0) 






o 030 


(2 7) 


BYTES!" 


-0.638 


20.4) 




-0.634 


(20.0) 


■0 653 ( 


19.9) 


0 6*6 


(20 0) 


BLACK F 


-(1.042 


(3.S) 


- 0 060 (4 9) 


■■ (1.(145 


(3.9) 


■0.042 


(3.(i» 


(1 045 


(3 2} 


EXP 


0.06S 


(2.0) 


-0.023 (0.5) 


0.063 


(l.S) 


0.058 


(1.5) 


0 046 


(1 2? 


MA 


-0.004 


(0.3) 


0.006 (0.5) 


-0.003 


(0 3) 


-0.001 


(0 I) 


0 (KM 


(O 1) 


VERB 


0.093 


(4.1) 


0.0d3 (2.4) 


0.09ft 


(4.2) 


0.076 


(3 1) 


II ( in 1 


( > 2i 


// 


969 




W) 


953 




1 >2S 




92s 






0.728 




0.616 


(1 727 




(I.69S 




(1 "Oil 





where: 

INTER intercept term 

FEM percentage of the school s students that are female 

BLAC KS percentage ot the school's students that arc black 

FNHH percentage ot the school's students without ;i lather in the household 

MNHH percentage ot the school s students without a mother in the household 

PHONE percentage of the school's students with a telephone in the household 

INCOME! mean income ot the families of the school's students (in thousands) 

FLNCH percentage of the school's students that receive free lunches 

LED mean \ears ol education of fathers ot the school's students 

MED mean vears of education ot mothers of the school's students 

CITY 1 = central eitv school. 0 = other 

RURAL 1 - rural school. (I = other 

BOOKS number ol books in the school's libnuy (000s) pet pupil 

SLAY peicentage of si\th grade students who ha\e not changed schools since the lust guide 

BYTEST mean grade 3 test score (percentage ol correct ansssers) m the school 

BI.ACKT percentage ot black teachers in the school 

PUPT pupils per teacher in the school 

EXP mean \cars ol experience of teachers in the school 

MA percentage of teachers with at least a master's degree in the school 

VERB mean vcibal test score of teachers in the school 

'Each student's test score is the simple aserage of the percentage of cm reel answets the student received on \eibal. 
nonverbal, reading, and mathematics tests. 

''Weighted least squares regressions. The weight used is |(N, x N„/(Ni + N,.)| wheie V (N„) is the numbci ol thud 
(sixth) grade students taking the test in the school. 

Source: Author's computations trom the fuftuihtv of Ofywrtunitv surve> data tapes 



substantially influences the other coefficients. This 
is done in column (2). While mean teacher experi- 
ence is now no longer significant, teacher \crhal 
ability is still positively, and the peicentage of black 



teachers ncgativcK. associated with a school s gam 
score. " M 

Does mobility ol students into anil out ol a school 
influence the amount ol teaming that goes on m the 
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school? Column (3) adds as an explanatory variable 
the percentage of sixth grade students in a school 
that spent their entire elementary school careers in 
the school (STAY). This variable does not prove to 
be statistically significant and its inclusion does not 
substantially influence any of the other coefficients 
in the model (compare columns (1) and (3)). 

Of course, as noted in the introduction, it would 
be desirable to confine the computation of the 
synthetic gain scores to students who hid always 
remained in the same elementary school. This is 
done in columns (4) and (5) where the gain score is 
now computed as the mean test score for sixth grade 
students who spent their entire school career at the 
school minus the mean test score for third grade 
students who spent their entire careers in the 
school. 24 

The estimated association between the percentage 
of black teachers and teachers' verbal aptitude 
scores and the synthetic gain scores in this restricted 
sample are quite similar to those found in the 
unrestricted sample. One new finding, however, is 
that the synthetic gain scores of these stayers are 
larger, the larger is the proportion of sixth grade 
students in the school who spent their entire careers 
at the school. Put another way. the more turnover 
there is in a school's student population, the lower 
the gain scores are for the students who remained at 
the school. 

The analyses reported in Table 2 group all 
students together. They do not permit us to address 
an issue that is of key importance to us — whether a 
teacher's race and verbal ability differentially 
influence the academic achievement of students of 
different races. To address this issue. Table 3 
reports estimates of selected coefficients from 
equations that were estimated separately for black 
and white students. In each case the synthetic gain 
score is now computed for each school using only 
data for students of the given race. 

Columns (1) for the white and black student 
samples report coefficients from equations specified 
identically to column (1) in Table 2. In this model, a 
higher percentage of black teachers in a school is 
associated with a lower synthetic gain score for 
white students but is not associated with a higher 
gain score for black students. Teacher verbal ability 
is positively related to gain scores for both groups of 
students and the magnitude of the relationship is 
about the same. Teacher experience has a payoff 
only for white students and having more teachers 



with advanced degrees enhances learning for black 
students, but perversely lowers it for white students. 

To estimate whether the effects of teacher verbal 
ability, experience and degree level vary for each 
group of students with the race of the teachers, 
expanded versions of the equation underlying 
columns (1). (3). and (5) of Table 2 were estimated 
that allowed for interactions. For example, in the 
case of column ( 1). the estimating equation became: 

GAIN,, = a„, + a^X,, + a : ,S, + a»,BYTEST„ 
+ tt 4| ((BLACKT l /l(H)) ! 'T l[i ) 
+ Q, | ((l-BLACKT l /ltH)))*T lNV + e„ (2) 

Here j. equal to 1 or 2. indexes the black or white 
student equation and T lM (T,\ V ) is the vector of 
characteristics of black teachers (white teachers) in 
school i. 

Selected coefficients from these models appear in 
columns (Iw), (3w) and (5w) for white students and 
columns (lb). (3b). and (5b) for black students. The 
estimates in column (1) for each group come from 
equations that are restricted versions of the 
equations that underlie columns (lw) and (lb) and 
hence one can test for each group whether t he- 
restrict ions are valid. Formal F tests suggests the\ 
are not.'" That is. we can reject the h\ pot he sis that, 
for each group of \tudents. the effects of all the 
teacher characteristics variables are the same for 
black and white teachers. 

The results in this table are striking. In most 
specification^ the percentage of students that are 
black does not affect either black or white students* 
gain scores. Higher verbal scores for black teachers 
are associated with higher gain scores for both black 
and white students. In contrast, white teachers' 
verbal scores matter only for white students. Higher 
white teacher experience levels are associated with 
higher gain scores for only white students and black 
teachers* experience levels do not appear to have 
any impact on either group of students* gain scores. 
Finally, while an increase in the percentage of black 
teachers in a school with at least a masters degree 
increases the gain scores of black students, an 
increase in the comparable percentage for white 
teachers again pet tersely is associated with lower 
gain scores. ^ 

Several extensions warrant brief reporting here.' 7 
First, when the analyses underlying tables 2 and 3 
were repeated separately for each of the four 
individual tests (verbal aptitude, reading, nonverbal 
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aptitude, and mathematics), teacher verbal ability 
was found to be positively associated with the 
synthetic gain scores on all four tests, with the 
magnitude of the relationship being smallest for the 
mathematics gain score. 

Second, as discussed in footnote 9. students were 
administered different tests in the third and the sixth 
grades. It is. therefore, not obvious that the metric 
we have chosen, the synthetic gain score, is 
neccssariK the correct outcome variable to use. : * To 
test for the sensitivity of our findings to the metric 
chosen, we recstimated the equations found in 
column 1 of Table 3 using first the difference in the 
logarithms of the mean sixth and third grade test 
scores in a school as the outcome variable and then 
the difference in the school's percentile rank on the 
sixth grade tests (among all schools in the sample) 
and its percentile rank on the third grade tests. :v We 
found that teacher verbal ability continued to be 
positive Is associated with the outcome variable in 
each of these specifications. 



C. Secondary School Analyses 

Table 4 contains weighted least square estimates 
of synthetic gain score equations for the high schools 
in our EEO sample. The outcome variable is now 
the mean percentage of correct answers of twelfth 
grade students in the school on their group of tests 
minus the mean percentage of correct answers of 
ninth grade students in the school on their group of 
tests. Because the sample sizes are smaller, fewer 
coefficients than in the elementary school sample 
prove to be statistically significant and fewer 
analyses are reported. 

In these high school data, higher verbal aptitude 
scores of teachers are associated, on average, with 
higher gain scores for white students, but not for 
black students. When teacher characteristics are 
broken down by race, white teachers' verbal 
aptitude scores appear to matter for both groups of 
students, but black teachers' verbal aptitude scores 
do not. Increasing the percentage of black teachers 
in a school with at least a masters degree is 



Table 4. School level \\nOictic earn score equations: Grades 9 to 12 (absolute value / statistics) 
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(0 7) 


4.161 
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() (H6 
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-0.021 (0.5) 


0.021 (0.8) 


BLACKS 


- 0.025 


(1 2) 


0.057 


(2.0) 


0.063 


C2) 


-0.002 (0.1) 


-0.017 (0.5) 


INMH 


(1 090 


(3.6) 


O.I 2^ 


(.VI) 


-0.139 


(3.3) 


-0.0S5 (2.5) 


-0.076 (2.3) 


MNHH 


0.059 


(12) 


- (1 044 


(0 6) 


0 056 


(O.S) 


-0.056 (1.0) 


-0.047 (0.8) 


PHONE- 


0.054 


(3.3) 


0.004 


(0.1) 


-(1.1X15 


(0.2) 


0.064 (3.2) 


0.064 (3.2) 


INCONU 


0.695 


(2.6) 


0.7|9 


(2.2) 


0.802 


(2.4) 


O.SSS (l.S) 


0.978 (2.0) 


1LNCH 


(1.007 


(0.4) 


- 0.043 


(1.5) 


-0.049 


(L7) 


0.023 (1.6) 


0.036 (1.7) 


ll.O 


0.773 


(2.1) 


0 694 


(1.5) 


0.741 


(16) 
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L488 (2,2) 


\U:1> 


(1.541 
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1 . 268 


(2 3) 


1.125 


(2-0) 
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-0.754 (1.2) 


CITY 


0.1 38 


(0 3) 


0.255 


(0.4) 


0.4% 


(0.8) 


0,204 (0.3) 


-0.060 (0.1) 


RURAL 


0.32S 


(0.7) 


-0.063 


(0.1) 


-0.031 


(0.1) 


0.556 (0.8) 


0.645 (LO) 


BOOKS 


0.02S 


(0.6) 


0.004 


(0.1) 


0.022 


(0.4) 


-0.031 (0,4) 


-0.097 (1.2) 


PL'PT 


(MINN 


(2.1) 


0 054 


(1.1) 
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(1.0) 


0.006(0 9) 


0.017 (0.2) 


BYTLST 


-0.556 (14.7) 


-0 621 (12.2) 
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(1 026 
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0.37K 



'See Tables 2 and * tor variable deluntions RYTFST now the mean pradc 9 test score (percentage of correct 
ansvvets) in (he school. 
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associated with higher gain scores for both black and 
white students, although again the percentage of 
white teachers with at least an MA degree is 
negatively associated with white students' gain 
scores. 

There is also evidence that white students' gain 
scores are positively associated with the percentage 
of students in the school that are black and 
negatively associated with the percentage of 
teachers that are black. In contrast, while black 
students* gain scores are not related to the per- 
centage of students in the school that are black, they 
do appear to be positively associated with the 
percentage of black teachers. That is. in the EEO 
data, other things held constant, black teachers do 
appear to improve the gain scores of black students 
at the high school level. 

IV. CAN SCHOOL AND TEACHER 
CHARACTERISTICS BE TREATED 
AS EXOGENOUS? 

A. Conceptual Issues 

Differences in school or teacher characteristics 
are not randomly determined across schools. 
Families choose where to live, and hence their 
children's schools, based on their own preferences 
and resource constraints/" Teacher characteristics 
depend upon factors such as the salaries teachers are 
offered, and the pecuniary and non pecuniary 
characteristics of the community in which the school 
is located. 31 These considerations suggest that fail- 
ure to treat teacher and school characteristics as 
endogenous may lead to biased estimates of their 
affects. K Yet to date, virtually all studies of teacher 
and school affects have treated these characteristics 
as exogenous. 33 

These biases might arise if the teacher and school 
characteristics used in the synthetic gain score 
equations (equations ( I ) and (2)) are correlated with 
the error terms in the equations due to an omitted 
variable problem. For example, suppose that we are 
interested in the effects of teachers' verbal aptitude, 
that we assume (for now) that teacher verbal 
aptitude actually does not influence gain scores, and 
that the omitted variable is a measure of the value 
that the parents of students place on education. 
Presumably parents who value education highly will 
invest more in their children at home (thus leading 
to higher gain scores) and will also reside in school 
districts that pay high salaries to attract and retain 



teachers with high verbal aptitude scores (if they 
believe, erroneously in our example, that high 
teacher verbal aptitude enhances learning). Other 
things held constant, estimation of equations (I) or 
(2) by least squares would yield a positive relation- 
ship between gain scores and teacher verbal aptitude 
even though we hve assumed (for now) that the true 
relationship is zero, 

The bias arises in our example because of the 
endogeneity of families' locational decisions coupled 
with our inability to fully control for unobserved 
variables that simultaneously influence students' 
gain scores and their families' locations (which in 
turn determined teachers' verbal aptitude). We 
address this problem below b\ using an instrumental 
variable estimation method to obtain instruments 
for the school and teacher characteristics variables, 
conducting formal statistical tests to ascertain which 
of these characteristics can be legitimately treated as 
being exogenous and which must he treated as 
endogenous, and then reestiniating the synthetic 
gain score equation using the original \ariables for 
the exogenous characteristics and the instru- 
mental variable estimates for the endogenous 
characteristics. 

B. Institutional Variable Approach 

Appendix Table A2 summarizes the equations we 
estimated to generate instruments tor the school and 
teacher characteristics \ariahles. These equations 
were estimated primarily to obtain instruments for 
the school and teacher \ariahlcs and thev should not 
be thought of as structural equations. In each case, 
the actual value of these sanahles were regressed on 
a set of characteristics of the families of these 
students at the school, a broader set of character- 
istics of residents of the county or SMSA in which 
the school was located, and an estimate of the 
starting teacher salarv in each school district/ 1 The 
county level sariables are suitable for uh" as 
instruments because they are less likely to he 
correlated with the error term in the gain score 
equations than are school or district level variables. 
The county and SMSA \ ariables w ere obtained from 
the 1965 City and County Databank and starting 
salaries were obtained from within -school district 
teacher salary equations that were estimated by us. 
using the indi' idual teacher data from the EEO. " 

The instruments obtained from the coefficients m 
Appendix Table A 2 were used to test whether the 
assumptions that all the teacher and school charac- 
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t eristics can be treated as exogenous are valid, using 
Durbin-Wu-Hausman specification tests in the 
following two-stage fashion. First, an expanded 
version of the model that appears in column (I) of 
Table 2 was estimated that included both the 
original values and the instruments for alt six teacher 
and school characteristics variables. The coefficients 
of the instruments for MA and PUPT each had / 
statistics that exceeded 1.9 in absolute value and a 
formal F test suggested that one can reject the 
lnpothesis that the vector of coefficients of the six 
instruments as a set are all zero, and hence that all of 
the school and teacher characteristics variables 
should be treated as exogenous/ These tests imply 
that at least MA and PUPT should be treated as 
endogenous in the estimation of this gain score 
equation. 

Second . a version of the model was estimated in 
which the instruments for MA and PUPT replaced 
the original values of these variables, but both the 
original values and the instruments for the other 
teacher and school characteristics (BOOKS. 
BLAC K I . KXP. VHRB) were included. None tit 
the eoethcients of the instruments for the latter four 
variables in this model had a t statistic that exceeded 
1.5 m absolute value and a lormal V test suggested 
that one cannot reject the hypothesis that the entire 
vector of these latter four instruments* coefficients 
aie all equal to zero. s This test implies that in a 
formal statistical sense it is legitimate to treat 
BOOKS. BI ACKT. F.XP. and VHRB as exogenous 
in the estimation of the gain score equation. 

As a result, the synthetic gain score equation 
found in column ( I ) of Table 2 was first reestimated 
with instruments used only tor PUPT and MA. A 



comparison of the coefficients of the school and 
teacher characteristics variables that were obtained 
when weighted least squares was used on the 
original data (column 1, Table 5) and when a 
weighted instrumental variable procedure was used 
with these instruments (column 2, Table 5) suggests 
that the percentage of black teachers continues to be 
negatively, and teachers* verbal aptitude positively, 
associated with the synthetic gain scores. Moreover, 
the magnitudes of the coefficients of the two 
variables are roughly the same in the two specifi- 
cations. Hence, if we take our endogeneity tests at 
face value, our findings about the roles that teacher 
ability and race play do not appear to be biased by 
endogeneity issues. 

We note, however, ihat treating BOOKS. 
BLACKT, E-XP. and VKRB as exogenous is not 
consistent with either our underlying economic 
model or the empirical evidence provided by- 
others. *' If one also treats BOOKS, BLACKT, 
HXP. and VHRB as endogenous and reestimated 
the model, the results in column 3 are obtained. 
While the estimated effect of teacher verbal ability 
increases substantially, the coefficient of the 
proportion of black teachers now switches sign and 
is statistically insignificant. Thus, our conclusion 
about the effects that teachers" skin color per si* had 
on students in the I%0s hinges on the accuracy of 
our specification tests. 4 " 

IV. SIMULATIONS 

Did teachers verbal ability and race matter in the 
l%()s? Our reanalysis of the EEC) data suggest that 
the answer to the first question is definitely yes. 



Table 5. Compaiison ot WIS and WIY estimates of school level 
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while the answer to the second question is much 
more sensitive to the statistical model used. On 
balance, increases in the verbal aptitude scores of 
both black and white teachers were associated, 
other things held constant, with higher synthetic 
gain scores. On balance, increases in the percentage 
of black teachers in a school were associated, other 
things held constant, with lower gain scores for 
white students at both the elementary and secondary 
level and higher gain scores for black students at the 
secondary level. The latter results do not always 
hold up, however, once one attempts to control for 
the endogeneity of teacher and school 
characteristics. 

A number of simulations that are summarized in 
Table 6 provide the reader with estimates of the 
quantitative importance of these characteristics in 
the weighted least squares analyses. These simu- 
lations make use of the estimates that were obtained 
separately for black and white students and, thus, 
that allowed the influence of teachers' character- 
istics to vary with the race of the teachers (Tables 3 
and 4). To given the render a sense of the mag- 
nitudes that follow, we note that the results in 
column 1 of Table 2 suggest that reducing class size 



by 10 students per teacher for elementary school 
students, would be associated with roughly a (1.8 
point increase in the students' gain scores. 

The mean percentages of black teachers in the 
sample were 4 (3) for white elementary (secondary) 
school students and 71 (77) for black elementary 
(secondary) school students, respectively. The first 
three simulations ask what the impact would have 
been on students if the percentage of black teachers 
had been 10 percentage points higher? Those 
reported in row I hold constant the mean values of 
black and white teachers' other characteristics (MA. 
EXP, VERB) at their sample values. However, 
since the mean values of black and white teachers' 
characteristics differed, especially for VERB, this 
first simulation provides no information on whether 
teacher skin color per ve would matter if other 
teacher characteristics were the same. The simu- 
lations reported in rows 2 and 3 address this issue. 
The former assumes that all teachers have the mean 
sample value of black teachers' characteristics, while 
the latter assumes that all teachers have the mean 
sample value of white teachers' characteristics 

In fact, the three sets of simulations yield quite 
similar findings. Depending upon the particular 



Table 6. Estimated changes in gain scores from changes in teacher race and verbal ability 



Simulation 

(!) Increase I he percentage of black 
teachers by 10 (holding constant 
all other teacher characteristics) 

(2) Increase the percentage of black 
teachers by 10 (assuming black 
and white teachers both had the 
mean characteristics of black 
teachers) 

(3) Increase the percentage of black 
teachers by 10 (assuming black 
and white teachers both had the 
mean characteristics of white 
teachers) 

(4) Increase the verbal aptitude 
scores of black teachers by 10 
points 

(5) Increase the verbal aptitude 
scores of white teachers by 10 
noints 
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"Estimated change is statistically significant different from zero at the 0.05 level of significance, two (ail lest 
"Estimated change is statistically significantly different from zero at the 0. Ill level ol significance, iwo-tail tesi 
Source: Authors* calculations from data in Tabic I and coefficient estimates in the indicated columns from Table 3 (lot 
grades 3 to 6) and Table 4 (for grades 9 to 12). 
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elementary school equations used in the simulation, 
increasing the percentage of black teachers by 10 
percentage points is estimated to reduce the syn- 
thetic gain scores of white elementary school 
students by roughly 0.7 to 1.2 points and to reduce 
the gain scores of black elementary school students 
by roughly 0 to 0.5 points. This should be contrasted 
to mean gain scores of 5.86 and -4.42 for the two 
groups, respectively. At the high school level, the 
increase would reduce the synthetic gain scores of 
white students by roughly 0.9 to 1.2 points, but 
increase the gain scores of black students by roughly 
0.2 to 0.4 points. This should be contrasted to mean 
gain scores of 0,86 and -0.53. respectively. 

The mean verbal aptitude test scores of white 
elementary (high school) teachers and black 
elementary (high school) teachers were 81.13 
(81.17) and 73.54 (80.20) respectively for white 
stut'ents and 78.76 (76.62) and 64.03 (66.91) 
respectively for black students. The simulations 
reputed in the fourth and fifth rows of the table ask 
\\hui the impact on the synthetic gain scores would 
have been if all black teachers* verbal aptitude 
scores were increased by 10 points (row 4) and if all 
white teachers* scores were increased b\ 10 points 
(row 5). The standard deviation of teachers' verbal 
aptitude scores is approximately 20 points for each 
ot the white and black teacher samples, so a 10-point 
increase represents a change of about one-half of a 
standard deviation. Given that white teachers taught 
primarily white students and black teachers taught 
pmnarik black students in the l%()s. one should 
expect that improving unly the verbal test scores of 
teachers ot one race would influence prima rilv the 
gain scores ot students of that rice. This in fact 
occurs. 

Improving the verbal aptitude scores of black 
teachers b\ 10 points is estimated to increase the 
svnthetic gain scores of white elementary school 
students bv roughly 0.14 points and of black 
elemental"} school students by roughly 0.9 points. 
Similar I v increasing the verbal aptitude scores of 
white elementary school teachers by 10 points is 
estimated to increase the synthetic gain scores of 
white elementary students by between 0.24 and 0.50 
points, but to have very little, or even a small 
negative effect, on black elementary school 
students' scores. At the high school level, neither 
white nor black students gains scores are predicted 
to change very much in response to either a 
change. 



VI. CONCLUDING REMARKS 

What conclusions should one come away with 
from our findings? Teacher race and verbal aptitude 
did matter in the 1960s in the sense that they were 
associated with synthetic gain scores! Verbal 
aptitude scores of teachers nationwide have declined 
substantially during the last two decades. 41 If one 
adds a quadratic term in ability to our gain score 
equation, one finds that the marginal affect of ability 
increases as ability declines, so tht the payoff to 
improving teachers' verbal scores probably is even 
higher today. 42 

We must caution, however, that our results are 
tor synthetic gain scores in the mid-1960s. The mid- 
1960s was a period of change for schools in the 
United States and this may reduce the appropriate- 
ness of our synthetic gain score approach, which 
assumes that the same "fixed" unobservable vari- 
ables identically affected the test scores of students 
in different grades at a school at a point in time. 
Furthermore, synthetic gain scores, especially for 
the high school data, are dependent on drop-our 
rates. Oilier things equal, assuming the drop-outs 
come from the lower tail of the test score distri- 
bution, the higher the drop-out rate in a school 
between the ninth' and twelfth grades, the higher the 
twelfth grade score will be and thus the larger the 
gain score will be. Put another way. our results may 
be subject to a form of selection bias. While no data 
exist on drop-out rates in the EEC), results we report 
elsewhere based on analyses of data from High 
SchfKtl and ttcyond suggest that teacher race and a 
proxy for teacher ability do not influence a student's 
drop-out probability, so perhaps this is not a 
problem. n 

Changes in student test scores over their school 
careers are not the sole outcome of interest. Do 
teacher verbal aptitude and race influence college- 
going behavior, college completion rates, or post- 
educational labor market outcomes? Do they 
influence noneeonomic outcomes such as attitudes 
students hold towards individuals from other racial 
groups? These questions cannot be answered with 
the EEC) data, although one recent study on a 
related topic did suggest that school quality 
measures do affect labor market outcomes. 44 

The racial and ethnic distributions of students and 
teachers are different today than they were 30 years 
ago when the EEC) survey was undertaken. So. 
perhaps, are the attitudes about and expectations of 
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black and white students and teachers towards 
members of the other race. Although our reanalyses 
of the EEO data found in some specifications that 
increasing the percentage of black teachers in a 
school, other things held constant, was associated 
with lower white student gain scores at both the 
elementary and secondary levels and higher black 
student gain scores only at the high school level, 
these findings were sensitive to assumptions about 
the endogeneity of teacher and school character- 
istics. Furthermore. estimated relationships 
obtained from "educational production function" 
analyses are often not the same across studies and 
there is nothing that guarantees that these relation- 
ships would hold today. 45 For example, one recent 
study of the actual gain scores between the second 
and sixth grades on reading and vocabulary tests for 
low-income black students in Gary, Indiana found 
that, holding other characteristics of teachers 
constant, black elementary school teachers did 
enhance the performance of these black students. 4h 
Clearly, before drawing any policy conclusions 
about the importance of teacher skin color, per se, it 
is necessary to replicate our analyses using more 
recent data. 47 

Conceptually, however, the issues we have raised 
should not be ignored. Minority teachers may. on 
average, improve the academic performance of 
black students but adversely influence the academic 



performance of white students today. Teacher 
verbal aptitude may matter today, on average, both 
for white and minority teachers. To the extent that 
the latter tend to have lower test scores, hiring 
minority teachers with lower verbal aptitude scores 
than white teachers may adversely affect the gain 
scores of both minority and white students. The case 
for expanding the number of minority teachers in 
public education rests on distributative as well as 
efficiency considerations. However, contemporary 
empirical evidence of the type we have presented 
surely should be part of the policy debate. 
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NOTES 

1 . On the declining pool of potential minority teachers see Cole ( 198G). Irvine (W88). Bcrger ( 1990). 
and Zinn (1990). For evidence that minority teachers are more likely to fail the National Te ache is 
Examination (NTE). and. in Texas, the Texas Educational Assessment ol Minimum Skills 
Examination, see Cole (1986), Ferguson (1991a). and Thernstrom (1991). 

2. See, for example. Irvine (1988). 

3. On the potential cultural bias of tests and the validity of the NTE see National Research Council 
(1989). Ayers and Quails (1979). Avers (19KK). Haney ct al. (19S7). Darling-Hammond and Wise 
(1983). and Sheehan and Marcus (1978). Studies that show that teachers* "ability", as measured h\ 
test scores, does affect student academic achievement include Armor (1972). Boardman, etal. ( I97K). 
Coleman etal. (1966), Ferguson ( 1991a; 1991b). Sheehan and Marcus (1977). and Strauss and Sawyer 
(1986), In contrast Summers and Wolfe (1977) find no evidence that teachers* "ability" matters. 

4. These studies include Aaron and Powell (1982). Aloia. Maxwell and Aloiii ( 1981 ). Banks (19SS). 
Barnes (1979), Baron (1985). Beady and Hansell (1981). Braun (1976). Brophv ( 1981). Brown et al. 
(1970). Byalick and Bersoft (1979). Carew and Lightfoot ( 1979). Coales (1972)*. C ooper et til. ( 1975). 
Cooper and Tom (1984). Cornbleth and Korth (1990). Dusek and Joseph (1983). Eaves (1975). 
Feldman (1986). Gottlieb (1964). Haller (1985). Heath (1971). Holiday (1985). Irvine (1985: 1986: 
1990). Jackson and Cosea (1974). Leiter (1976), Mathis (1976), Meier et al. (1989). Natriello and 
Dornbusch ( 1983). Simpson and Erickson ( 1983). Sizemore (1981 ). Tobias a al. (1983). and West and 
Anderson (1976). 

5. See. for example. Alexander et aL (1987). Bridge etal. (1979). Crain and Mahard (1978). Crain ci 
«/. (1982), Cunningham (1975). Darkenwald (1975). Farkas et al. (1990). Ferguson (1991a; 1991b). 
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Click (1971). Mux nor (197(1). Murnane (1975). Ohberg (1972), Pascarella et at. (!979) Rossel and 
Hawley (1983). Sanders (1982). Shechan and Marcus ( 1977). St John (197!). Touliatos et aL (1977), 
and Yando et at. (1971). 

ft. See. for example. Bowles and Levin ( 1968). Cain and Watts ( 1970) and the set of papers published 
in Mosieller and Movnihan (1972). 

7. See. for example.' Coleman et at. (I9ftft). Armour (1972), Hanushek (1972), Jencks (1972). 
Thernstrom (1991) has recentlx reminded people of this finding, 

8. Sec Hanushek (1986). 

9 Students in each grade were administered a battery of subject and aptitude tests. For each subject, 
different tests were administered to students in each grade so that one cannot infer anything about the 
absolute amount students in a school learn by comparing say, the mean third grade and mean sixth 
grade test scores. Hoxxcxer, one can infer something about how much students in a school were 
learning in relative terms by comparing gain scores across schools. So, for example, if the mean third 
tirade score on a test v. as S0°«. in each of two schools and the mean sixth grade scores were 80% and 
90°.. in the two schools, respectively, the implication is that the students learn more between the third 
and sixth grade in the second school. 

10 A major finding of the Coleman Report was how segregated by race schools were in 1966. For 
example, almost SO",, of all while pupils in the first and I w elfin grades attended schools that were 90 to 
100",. white, while more than ft5".i (S5°«) of black students in the first (twelfth) grade attended 
schools that were between 9(1 and 100".. black (Coleman et <//.. 1966, p. 3). In the research that 
follows, gain scores are computed by race for a school if at least one student of that race arc present in 
the base x ear grade. Since weighted least square analyses are used (with the weights based on the 
number of students in the racial group in the grades in the school), schools with only a few students 
horn a racial group are given vcrv little weight in the race-specific analyses. 

II. While onlx a small traction of white students were taught by black teachers in 1966. a greater 
traction ot black students were educated b\ white teachers. For the nation as a whole, the average 
black ek-mentarv (secondary) student attended a school in which 35°/,. (41%) of the teachers were 
white (Coleman ft ol . 1 96ft. p. 3). 

12 Pnor analyses ot how teachers are sorted across school districts and how they decide whether to 
remain in the profession have been undci taken. See. for example. Antos and Rosen ( 1975). Ferguson 
( I 'Nib), jnd Murnane and Olsen ( 199(1). Howes er. Lhrenberg and Brewer (1994) is the first study to 
Ue.it teacher charactenstics as endogenous in the estimation of educational production functions. 

1 3 The l.l.O data tapes ..re very poorly documented and considerable effort had to he expended by us 
to "clean'' the data. A data appendix, available from us on request, discusses a number of the 
pi obi ems we faced and the actions we took. 

14. Although over 60.000 teachcis weic surveyed m the original i.liO survey, the data set we received 
lioni the National Aichixc contained information on onlx 44,193 teachers and came with a notice that 
ix\o te.ichci tiles were missing When teacher data was missing, it appeared to be missing for all the 
teachers in a school and all ot these schools jire necessarily excluded from our analyses. This 
restriction alone reduced the total number of schools in the sample from about 4000 to 2075. This 
implies that most of the schools that tailed to report teacher data were relatively small. 

15 Any school that enrolled both white and black students, will appear in both the white student and 
the black student samples, lest scores for suidents ot other ethnicities/races, primarily Native 
Americans. Hispanics. and Asian Americans, are used to compute the mean test scores in the overall 
sample, but not in the black or white samples. 

16. Part ot the reason for the small high school sample is that many ninth graders attended junior high 
schools and thus were not enrolled in schools with twelfth grades. A comparison of the descriptive 
statistics loi the samples actually used m our analyses (Table I ) and the largest samples available if we 
ignoie our need tor leachci characteristics data, suggests that the two samples were quite similar, for 
both element aiy and secondary schools, in terms ot their base year test scores, synthetic gain scores, 
and proportions of black teachers and students. 

17. The high percentages ot black students and teachers m our sample occur because black schools were 
over-sampled in the original h.LO survey . No sample weights were found by us in the data so we could 
not control lor problems relating to choice-based sampling in our statistical analyses. 

15. Gain scores can be negative because, as discussed in footnote 9. the tests administered in each grade 
were different. 

19. Requiring black leacheis to have more education than while teachers at schools with while students 
max reflect either discrimination m hiring or a compensating differential for their lower test scores. 

20. The mean values ot the students' characteristics in a school were very similar for third and sixth 
grade students in each school and the latter were used in these analyses. Family income data were 
actuallx not available in the El.O data and had lo be estimated by us. We computed the occupational 
distribution (at the one-digil level) ot lathers in each school and then used this distribution and data 
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on the 1970 median male earnings in each occupation in the census region (4) in which the school was 
located to obtain an estimate of family incomes in each school. 

21 . A 10 percentage point increase in teachers' verbal aptitude scores is equivalent to a change of a little 
under one standard deviation in the overall distribution of teachers* scores. 

22. Appendix Table Al contains mean test score level equations for the four elementary and secondary 
grades that are specified similarly to column 1 of Table 2. While these test level equations suffer from 
omitted variable bias (see the introduction), it is clear that many ot the measured variables that 
influence the synthetic gain scores also influence the test score levels. 

23. An alternative way to handle the problem is to treat BYTEST as endogenous, obtain ;tn 
instrumental variable estimate for it. and then reestimatc the gain score equation in column ( 1 ) using 
the instrumental variable. We obtained an instrument by regressing BYTEST on the characteristics of 
the school's students, their families and the community in which the school was located (the X.) and 
similar variables for the larger county or SMSA in which the school was located. When the gain score 
equation w*s reestimated using this instrument, the coefficient of the instrument proved to be 
insignificant but the coefficients of BLACKT (-0.063) and VERB (0.066) were very similar to the 
corresponding coefficients found in column I of Table 2 and both remained statistically significant. 
Consequently, in what follows, we treat BYTEST as exogenous. 

24. In some schools, either all students failed to report whether they spent their entire careers in the 
school or all students reported that they had not. Hence, the smaller sample sizes in columns (3) 
through (5). 

25. For the white students sample, the computed statistic is F( 3.776) = 5.87 and for the hlack student 
sample it is F(3.49l) = 7.93. Both of these values exceed the 0.99 critical values of F(3.120) or F(3.*) 
of 3.95 and of 3.78, respectively. 

26. One cannot infer the effect of black teachers per se in these models from the coefficients ot 
BLACKT alone since this variable also interacts with the other teacher characteristics (see equation 
(2)). We conduct simulations, however, using all of these characteristics and their coefficients in 
Section V. 

27. Tables of the results of these analyses are available from the authors. 

28. We are grateful to Fran Blau for suggesting this point to us. 

29. In each case, the mean third grade value of the variable (logarithm of mean test score or percentile 
rank on the third grade test) appears on the right hand side of the equation. 

30. See Tiebout (1956). 

31. Again, see Antos and Rosen (1975) and Ferguson (1991b). 

32. Evans, Oates, and Schwab (1992) make a related point in the context of estimating "peer group 
effects'* on drop-out rates. While they treat peer group measures, such as the percentage t-f 
disadvantaged students in a school as endogenous, they do not explore the influence of teacher oi 
school characteristics. 

33. Ehrcnberg and Brewer (1994) is the exception. Ferguson (1991b) treats teacher and school 
characteristics as exogenous when he estimates district -level educational production functions, but 
then goes on to show how these characteristics vary with underlying socioeconomic and demographic 
variables. 

34. We are grateful to Marshall Smith, currently Undersecretar\ at the U.S. Department of Education, 
for helping us to develop an algorithm to identify the county or SMSA in which each school in the 
EEO survey was located. The data appendix provides details. 

35. For each district, we regressed the logarithm of teacher s salary on the teacher's experience, and 
degree level, using all teachers we observed in the district as observations. The estimated intercepts 
from these equations are estimates of each district's logarithm of starting salary. 

36. See Russell Davidson and James M. MacKinnon ( 1993). pp. 237-242 for a more formal treatment. 

37. The computed F statistic was F(6.681) - 6.56 which exceeds the critical value of roughK 2.80 for 
rejecting the hypothesis at the 0.99 level. 

38. The computed F statistic was F(4.683) = 1.44 which is less than the critical value ot roughly 3.78 for 
rejecting the hypothesis at the 0.99 level. 

39. See Antos and Rosen (1975) and Ferguson (1991b). 

40. Two other extensions warrant brief reporting here. First, when one eliminates the estimated starting 
salary in a district from the instrumental variable equations (on the grounds that it is endogenous), 
one obtains virtually identical results to those reported in Table 6 and the text. Second, when one 
repeats the analyses eliminating the base year test score from the gain score equations (on the grounds 
that it too may be correlated with unobserved variables), one again obtains a similar pattern ot results. 
That is, Durbin-Hausman-Wu tests suggest that onty BOOKS. PUPT. and MA should be treated as 
endogenous and the estimated VERB and BLACKT coefficients obtained from such a specification 
are very similar to those obtained in column 2 of Table 2. 

41. See Murnanc et al. (1991), Chapter 2. 
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42. When VERB squared is added to the mode! specified in column I of Table 2 and then the expanded 
equation reestimated. the coefficient of VERB and VERB squared, respectively become, with the 
absolute values of their /statistics in parentheses, 0.366 (2.5) and -0.002 (1 .9). Although the marginal 
effect of VERB declines as aptitude increases in this model, it remains positive until VERB reaches 

^91.5. 

43. Ehrcnberg and Brewer (1994). 

44. See Card and Krueger (1992). 

45. See Hanushek (1986). 

46. Hanushek (1992). 

47. The data used by Ferguson ( 1991) on Texas school districts would be extraordinarily valuable if 
characteristics of teachers and students by race/ethnic group for each district could be made available, 
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Appcfldlx Table Al. School level mean test score equations: grades 3. 6, 9 and 12 (absolute value i statistics) ' 




GRADE 3 


GRADE 6 

■ - ■- ■■ - - 


GRADE 9 


GRADE 12 


■ ■ 

INTER 


38.998 (2.8) 


25.031 (8.3) 


17.646 ( 3.1) 


4.119 (1.0) 


FEM 


-0.018 (0.9) 


0.053 (2.6) 


-0.094 (3.6) 


-0.031 (1.9) 


BLACKS 


-0.075 (7.3) 


-0.095 (8.9) 


-0.059 (1.7) 


-0.047 (1.9) 


FNHH 


-0.056 (2.9) 


-0.085 (4.2) 


-0.221 (5.3) 


-0.219 (6.7) 


WKJL-IUI 

MiNnM 


-0.071 (2.6) 




-0.139 (1.8) 


-0.138 (2.2) 


PHONE 


0.077 (5.8) 


0.129 (9.2) 


0,051 (2.0) 


0.077 (3.7) 


INCOME 


0.059 (0.3) 


0,200 (1.0) 


1.634 (3.9) 


1.511 (4.6) 


FLNCH 


-0.02K (2.5) 


-0.039 (3.2) 


0.009 (0.3) 


0.002 (0. 1 ) 


FED 


0,854 (3.7) 


1.386 (5.7) 


1.440 (2 


4) 


1.515 (3.2) 


MED 


ll.fi.VJ (2.2) 


0.606 (2.0) 


0.329 (0,5) 


0.799 (1.4) 


CITY 


-3,161 (6.6) 


- 1.764 (3.6) 


-2.074 (2 


5) 


0.470 (0 8) 


RURAL 


-0.131 (03) 


1.039 (2.1) 


0,091 (0.1) 


0.502 (0.8) 


BOOKS 


0.003 (0.1) 


0.007 (0 2) 


0,069 (1.0) 


0.064 (1.2) 


PUPT 


-0.134 (3.9) 


-0.121 (3.3) 


-0.053 (0.8) 


0.074 (1.4) 


BLACKT 


0.029 ( 


2.6) 


-0.028 (2.4) 


0,015 (0 5) 


-0 01 1 (0.5) 


EXP 


0.133 (3.8) 


0.111 (3.1) 


0.016 (0 


2) 


0,071 (1.0) 


MA 


-0.017 (1.5) 


-0.013 (1.1) 


-0,010 (0.6) 


-0.030 (2.4) 


VERB 


0.048 ( 


2.1) 


0.108 (4.5) 


0.152 (2 9) 


0.165 (4.1) 


n 


969 




969 


256 




256 


R : 


0.701 




0.886 


0.881 




0.932 


'All variables are defined as in Table 2. 










Appendix Table A2. Estimation of instrumental variable 


equations' (absolute 


value ot t statistics) 




BOOKS 


PUPT 


BLACKT 


EXP 


MA 


VERB 


INTER 


-8.139 (0 5) 


56,248 (3.2) 


1.392 (2.5) 


- 16.275 (1.1) - 


S7.792 (1.2) 


3 355 (0.1) 


FEM 


0.029 (1.8) 


-0.021 (1.2) 


0.000 ( 0.07) 


-0.017 (1.1) 


0.054 (1.1) 


0.004 (0.1) 


BLACKS 


-0.022 (4.0) 


0.027 (4.4) 


0.008 (38.6) 


-0,010 (2.1) 


0.027 (1.6) 


-0.118(13.2) 


FNHH 


-0.005 (0.3) 


-0.018 (1.0) 


0.002 (2.7) 


0.033 (2.2) 


0.104 (2.1) 


0.047 (l.S) 


MNHH 


-0.029 (1.4) 


0.010 (0.5) 


0.003 (3.6) 


-0.355 (2.0) 


-0,198 (3.3) 


-0.158 (5.9) 


PHONE 


-0.007 (0.6) 


-0,006 ( 0.4) 


-0.001 (3,3) 


0.013 (1.2) 


0.045 (1.3) 


(I.0S4 (4.4) 


INCOME' 


-0.369 (1.8) 


0.276 (1,3) 


0.030 (4.5) 


- 0.175 (1.0) 


1.109 (2.0) 


-0 402 (1.3) 


FLNCH 


H.OOfi (0.5) 


-0.023 (2.2) 


-0.001 (1.3) 


-0.017 (1 9) 


0.021 (0.7) 


0.020 (1.3) 


FED 


0.421 (2.2) 


-0.260 (1,0) 


0.002 (0 3) 


-0.198 (1.1) 


-0.561 (1.0) 


0 193 (0.6) 


MED 


0.194 (0.7) 


-0.261 (1,0) 


-0.015 (1.7) 


0.381 (1.7) 


1.284 (1,7) 


0.527 (1.3) 


CITY 


-0.922 (2.1) 


0.991 (2.1) 


-0.120 (8.1) 


-0.707 (1.9) 


0 403 (0.3) 


.1 325 (5 0) 


RURAL 


0.016 (0.3) 


-0.921 (2.1) 


0.007 (0.5) 


-0.538 (1 5) 


0.382 (0.3) 


-0.146 10.2) 


ESAL 


2.066 (1.8) 


-3.880 (1.8) 


-0.146 (2.2) 


3.824 (2 3) 


7.335 (1.3) 


6.708 (2 2) 


POP 1, 


0.003 (2.3) 


-0.006 (3.2) 


-0.000 (1.8) 


0.001 (0.6) 


-0.0O6 (1.4) 


0.008 (3.1) 


POPDEN 


0.000 (1.3) 


-0.001 (2.1) 


-0.000 ( 6.1) 


0.000 (0.5) 


0.005 (4.9) 


0.001 (2.2) 


PURB 


-0.004 (0.3) 


0.013 (1.0) 


0.001 (2.9) 


-0.004 (0.4) 


0.104 (2.7) 


- 0.062 {} 0) 


PBLK 


0.003 (0.2) 


0.051 (3.4) 


0.002 (5.0) 


-o.om (0.3) 


-0.074 (I 8) 


-0 095 (4.3) 


P650 


-0.357 (3.7) 


-0.088 (0.9) 


-O.OIt) (3.0) 


0.191 (2.3) 


-0.031 (0 1) 


0.346 (2.3) 


MEDU 


-0.541 (1.5) 


0.969 (2.5) 


-0.013 (1.0) 


0.514 (1.6) 


-0.415 (0.3) 


0.341 (0.5) 


PGHS 


0.058 (0.9) 


-0.103 (1.6) 


0.002 (0.8) 


-0.124 (2.2) 


0.010 lO.I) 


--0.070 (0.7) 


SEN" 


-0.01 1 (2.0) 


0.002 (3.6) 


0.0(H) (2.0) 


0.(MN (19) 


0.043 (2.8) 


0 027 (3.2) 


PWHC 


(1.072 (1.6) 


-0.088 (1.8) 


0.003 (1.8) 


0.056 (1.3) 


0.164 (1.2) 


(1.008 (0 1 ) 


MINC" 


-0.686(2.2) 


0.858 (2.5) 


-0.056 (5.2) 


- 0.737 (2.7) 


5.227 (5.7) 


1 629 (3 3) 


PUBAS* 


-0.033 (2.3) 


0.027 (1.8) 


0.001 (1.9) 


-0.034 (2.7) 


-0.093 (2 2) 


0.042 (1 9) 


PSEE 


0.724 (0.2) 


-4.46M (1.5) 


0.251 (2.7) 


-3.881 (1.6) 


6.749 (0.9) 


-3.3*6 (0.8) 


n 


889 


888 


1129 


1129 1129 


1129 


*- 


0.148 


0.207 


0.870 


0.134 


0.156 


0.644 



ESAL 
POP 



estimated starting salary ot teachers in the district 
area population in 1960 
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POPDEN area population per square mile in I960 

PURB per cent area population living in urban areas in I960 

PB1-K per cent area population that is black in I960 

P650 per cent area population that is age 65 or older 

MEDU median school years completed for area population that is age 25 or older in 1960 

POHS per cent area population 25 or older in 1960 that completed high school 

SEN total area school enrollment in 1960 

PWHC per cent area adults in white collar jobs in 1960 

M1NC median family income in the area in 1960 

PUBAS per cent area families on public assistance in 1964 

PSEE per student expenditures in the area in I960 



All other variables are defined in Table 2. 
'Weighted least squares regressions. The weights used in each case are the number of teachers taking the test. 
Sources: (a) Authors* computations from the Equality of Educational Opportunity survey data tapes, (b) Authors* 
computations from the CVrv and County Databook data tape. The "area'* refers to the SMS A in which the schtx>l is 
located, if the school is in SMSA and the county in which the school is located for schools outside SMSAs, 
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